3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
157M Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
NIST
Written
Lexicon,
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Speech
Typological Database,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
GPL v3 + CC BY 4.0 + CC BY-SA 3.0 + PNAS
Size:
80 MByte Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
Yes, in the ReadMe.mdLanguage Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
11GB Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
9.1 MByte Production Status:
Existing-used
Use:
Question Answering
Paper:
N/A
Documentation:
<Not Specified>
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
OpenSource
Size:
740 Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
Yes, French, publicly availableLanguage Type:
Multilingual
Languages:
English Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/370_Paper.pdf
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
372K gzipped Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English Spanish
Availability:
Freely Available
License:
none
Size:
169 MB, 1,689,850 parallel sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005. http://www.iccs.inf.ed.ac.uk/~pkoehn/publications/europarl-mtsummit05.pdfLanguage Type:
Multilingual
Languages:
Bengali English
Availability:
From Owner
License:
CreativeCommons
Size:
1 GB Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
English




